Efficient Seeds Computation Revisited
نویسندگان
چکیده
The notion of the cover is a generalization of a period of a string, and there are linear time algorithms for finding the shortest cover. The seed is a more complicated generalization of periodicity, it is a cover of a superstring of a given string, and the shortest seed problem is of much higher algorithmic difficulty. The problem is not well understood, no linear time algorithm is known. In the paper we give linear time algorithms for some of its versions — computing shortest left-seed array, longest left-seed array and checking for seeds of a given length. The algorithm for the last problem is used to compute the seed array of a string (i.e., the shortest seeds for all the prefixes of the string) in O(n) time. We describe also a simpler alternative algorithm computing efficiently the shortest seeds. As a by-product we obtain an O(n log (n/m)) time algorithm checking if the shortest seed has length at least m and finding the corresponding seed. We also correct some important details missing in the previously known shortest-seed algorithm (Iliopoulos et al., 1996).
منابع مشابه
[hal-00742061, v1] Efficient Seeds Computation Revisited
The notion of the cover is a generalization of a period of a string, and there are linear time algorithms for finding the shortest cover. The seed is a more complicated generalization of periodicity, it is a cover of a superstring of a given string, and the shortest seed problem is of much higher algorithmic difficulty. The problem is not well understood, no linear time algorithm is known. In t...
متن کاملSweep Line Algorithm for Convex Hull Revisited
Convex hull of some given points is the intersection of all convex sets containing them. It is used as primary structure in many other problems in computational geometry and other areas like image processing, model identification, geographical data systems, and triangular computation of a set of points and so on. Computing the convex hull of a set of point is one of the most fundamental and imp...
متن کاملGromov-Witten invariants of Fano hypersurfaces, revisited
The goal of this paper is to give an efficient computation of the 3-point Gromov-Witten invariants of Fano hypersurfaces, starting from the Picard-Fuchs equation. This simplifies and to some extent explains the original computations of Jinzenji. The method involves solving a gauge-theoretic differential equation, and our main result is that this equation has a unique solution.
متن کاملFast Computation of Good Multiple Spaced Seeds
Homology search finds similar segments between two biological sequences, such as DNA or protein sequences. A significant fraction of computing power in the world is dedicated to performing such tasks. The introduction of optimal spaced seeds by Ma et al. has increased both the sensitivity and the speed of homology search and it has been adopted by many alignment programs such as BLAST. With the...
متن کاملArticle Pre-print Quantisation Revisited: A Mathematical and Computational Model
A nascent theory of near division is presented, from which an efficient quantisation algorithm for rhythm intervals is derived. Based on a number theoretic analysis of the uniqueness and convergence of this first algorithm, a generalised algorithm is presented. An empirical study of the algorithm’s performance reveals a readily computable criterion within which the perceived ratio may reliably ...
متن کامل